Evaluating automatic speech recognition as a component of a multi-input device human-computer interface
نویسندگان
چکیده
This paper reports on an investigation into the basic properties and requirements of automatic speech recognition as an input device to a trial human computer interface. The experiments required subjects to carry out a simulated target acquisition and report filling task, with the available input devices being automatic speech recognition, trackball, function keys or a simultaneous combination of all three. Experiments were carried out under varying workload to examine the degradation of overall interface and individual input device performance under user stress. An approach at modelling interface performance using a critical path analysis approach is introduced. Modelling of the interface developed here has shown a good match to the experimental results. Although use of the prototype speech recogniser was found to be both slower and less accurate than the manual mode inputs it was possible to estimate a required word accuracy of around 94% which would allow speech entry to provide an equivalent performance.
منابع مشابه
Input and output modalities used in a sign-language-enabled information kiosk
This paper presents description and evaluation of input and output modalities used in a sign-language-enabled information kiosk. The kiosk was developed for experiments on interaction between computers and deaf users. The input modalities are automatic computer-vision-based sign language recognition, automatic speech recognition (ASR) and a touchscreen. The output modalities are presented on a ...
متن کاملRecent Advances in the Automatic Recognition of Audio-Visual Speech
Visual speech information from the speaker’s mouth region has been successfully shown to improve noise robustness of automatic speech recognizers, thus promising to extend their usability in the human computer interface. In this paper, we review the main components of audio-visual automatic speech recognition and present novel contributions in two main areas: First, the visual front end design,...
متن کاملDesigning and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملA Survey on Speech Recognition
The Speech is most prominent & primary mode of Communication among of human being. The communication among human computer interaction is called human computer interface. Speech has potential of being important mode of interaction with computer. Speech recognition is the process of the computer identifying human speech to generate a string of words or commands. The output of speech recognition s...
متن کاملVideo-based face recognition in color space by graph-based discriminant analysis
Video-based face recognition has attracted significant attention in many applications such as media technology, network security, human-machine interfaces, and automatic access control system in the past decade. The usual way for face recognition is based upon the grayscale image produced by combining the three color component images. In this work, we consider grayscale image as well as color s...
متن کامل